Picture for Jongse Park

Jongse Park

Characterization of Multi-Model Agentic AI Systems on General Tasks via Trace-Driven Simulation

Add code
Jun 01, 2026
Viaarxiv icon

LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure

Add code
Feb 26, 2026
Viaarxiv icon

Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration

Add code
Nov 17, 2025
Figure 1 for Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Figure 2 for Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Figure 3 for Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Figure 4 for Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration
Viaarxiv icon

LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure

Add code
Nov 10, 2025
Figure 1 for LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
Figure 2 for LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
Figure 3 for LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
Figure 4 for LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure
Viaarxiv icon

Cocoon: A System Architecture for Differentially Private Training with Correlated Noises

Add code
Oct 08, 2025
Viaarxiv icon

Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse

Add code
Jun 17, 2025
Viaarxiv icon

MixDiT: Accelerating Image Diffusion Transformer Inference with Mixed-Precision MX Quantization

Add code
Apr 11, 2025
Viaarxiv icon

Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization

Add code
Mar 24, 2025
Figure 1 for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Figure 2 for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Figure 3 for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Figure 4 for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Viaarxiv icon

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Add code
Aug 10, 2024
Viaarxiv icon

DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

Add code
Mar 21, 2024
Viaarxiv icon